DHTJoin: Processing Continous Join Queries using DHT Networks
نویسندگان
چکیده
This paper addresses the problem of computing approximate answers to continuous join queries. We present a new method, called DHTJoin, which combines hash-based placement of tuples in a Distributed Hash Table (DHT) and dissemination of queries exploiting the trees formed by the underlying DHT links. DHTJoin distributes the query workload across multiple DHT nodes and provides a mechanism that avoids indexing tuples that cannot contribute to join results. We provide a performance evaluation which shows that DHTJoin can achieve significant performance gains in terms of network traffic.
منابع مشابه
Processing continuous join queries in qtructured P2P systems. (Traitement de requêtes de jointures continues dans les systèmes pair-à-pair (P2P) structurés)
Recent years have witnessed the growth of a new class of data-intensive applications that do not fit the DBMS data model and querying paradigm. Instead, the data arrive at high speeds taking the form of an unbounded sequence of values (data streams) and queries run continuously returning new results as new data arrive. In these applications, data streams from external sources flow into a Data S...
متن کاملP2P Join Query Processing over Data Streams
De nombreuses applications distribuées partagent la même nécessité de traiter des flux de données de façon continue, par ex. la surveillance de réseau ou la gestion de réseaux de capteurs. Dans ce contexte, un problème important et difficile concerne le traitement de requêtes continues de jointure qui nécessite de maintenir une fenêtre glissante sur les données la plus grande possible, afin de ...
متن کاملTowards a Unifying Framework for Complex Query Processing over Structured Peer-to-Peer Data Networks
In this work we study how to process complex queries in DHT-based Peer-to-Peer (P2P) data networks. Queries are made over tuples and relations and are expressed in a query language, such as SQL. We describe existing research approaches for query processing in P2P systems, we suggest improvements and enhancements, and propose a unifying framework that consists of a modified DHT architecture, dat...
متن کاملEfficient Index-based Processing of Join Queries in DHTs
Massively distributed applications require the integration of heterogeneous data from multiple sources. Peer-to-peer (P2P) is one possible network model for these distributed applications and among P2P architectures, distributed hash table (DHT) is well known for its routing performance guarantees. Under a general distributed relational data model, join query operator, an essential component to...
متن کاملSupporting Efficient Recursive Query Processing in DHTs
Although various approaches have been proposed to process complex queries, such as range queries, join queries, aggregate queries and etc., in Peer-to-Peer data management research, we find currently there are no efficient techniques to support linear recursive queries, which is also one indispensable types of queries and exhibits crucial importance in many scenarios. In this paper, we propose ...
متن کامل